Informatio

Informationen

  • Integrität der Datenquelle
  • Integrität der Darstellung

Integrität der Daten: Garbage in, garbage out

Foto von Jilbert Ebrahimi auf Unsplash.

Fehlender Kontext

Siehe storymaps.

“Land doesn’t vote … People Do.”

Siehe storymaps.

Daten in Kontext setzen

Zeichne Mittelwerte, oder hebe hervor aus einer Vergleichsmenge, z.B. europäischer Länder. Am einfachsten geht das mit einem zweiten Datensatz, der alle europäischen Länder enthält:

gapminder_line <- gapminder %>%
  filter(country %in% c("deu"), time >= 1900, time <= 2024) %>%
  drop_na(gini)

ggplot(data = gapminder_line, aes(x = time, y = gini)) +
  geom_line(aes(group = country))

Hinweis

Code
## Bilde einen eignen Datensatz, der alle europäischen Lönder enthält

gapminder_europe <- gapminder %>%
  filter(world_4region == "europe", time >= 1900, time <= 2024) %>%
  drop_na(gini)

gapminder_line <- gapminder %>%
  filter(country %in% c("deu"), time >= 1900, time <= 2024) %>%
  drop_na(gini)

gapminder_europe <- gapminder %>%
  filter(world_4region == "europe", time >= 1900, time <= 2024) %>%
  drop_na(gini)


p_line <- ggplot(data = gapminder_line, aes(x = time, y = gini)) +
  geom_line(data = gapminder_europe, aes(x = time, y = gini, group = country), color = "lightgrey", alpha = 0.5) +
  geom_smooth(color = "black") +
  geom_line(color = "#F4BA02", linewidth = 2) +
  theme_bg() +
  labs(
    title = "Ungleichheit in Europa On the Rise",
    subtitle = "1900 bis 2024, Gelbe Linie zeigt Deutschland",
    x = "Jahr",
    y = "Gini-Koeffizient"
  )

p_line

Daten in Kontext setzen

gapminder_line <- gapminder %>%
  filter(country %in% c("deu"), time >= 1900, time <= 2024) %>%
  drop_na(gini)

gapminder_europe <- gapminder %>%
  filter(world_4region == "europe", time >= 1900, time <= 2024) %>%
  drop_na(gini)


p_line <- ggplot(data = gapminder_line, aes(x = time, y = gini)) +
  geom_line(data = gapminder_europe, aes(x = time, y = gini, group = country), color = "lightgrey", alpha = 0.5) +
  geom_smooth(color = "black") +
  geom_line(color = "#F4BA02", linewidth = 2) +
  theme_bg() +
  labs(
    title = "Ungleichheit in Europa On the Rise",
    subtitle = "1900 bis 2024, Gelbe Linie zeigt Deutschland",
    x = "Jahr",
    y = "Gini-Koeffizient"
  )

p_line

Konventionen

Konventionen beachten! Die y-Achse sollte bspw. in der Regel bei 0 beginnen.

776.013 Home protection; use or threatened use of deadly force; presumption of fear of death or great bodily harm.— (1) A person who is in a dwelling or residence in which the person has a right to be has no duty to retreat and has the right to stand his or her ground and use or threaten to use:
(a) Nondeadly force against another when and to the extent that the person reasonably believes that such conduct is necessary to defend himself or herself or another against the other’s imminent use of unlawful force; or
(b) Deadly force if he or she reasonably believes that using or threatening to use such force is necessary to prevent imminent death or great bodily harm to himself or herself or another or to prevent the imminent commission of a forcible felony.

(Florida Satutes)

Manipulation des Bildformats

Reales Beispiel für Manipulation des Bildformates

The designer of the visualization, by selecting a y-axis starting point, has control over the subjective importance of the resulting differences.
(Correll et al., 2020)

Grafik aus Huff (1979), heruntergeladen von Observable blog.

Soll die Null geplottet werden?

Absolute Werte

Die Null sollte in der Regel mitgeplottet werden, da wir uns Mengen anschauen.

Veränderung der Werte

Hier ist es eher eine Ermessensfrage:



If zero is in the neighborhood, invite it in! (Andrew Gelman)



In general, in a time-series, use a baseline that shows the data not the zero point. (Tufte & Graves-Morris, 1983)

Veränderung der Werte

Schneiden von Achsen

In der Regel keine gute Idee! Verzerrt die Skala.

Correll, M., Bertini, E., & Franconeri, S. (2020). Truncating the y-axis: Threat or menace? Proceedings of the 2020 CHI conference on human factors in computing systems, 1–12.
Huff, D. (1979). How to lie with statistics / by Darrell Huff.
Tufte, E. R., & Graves-Morris, P. R. (1983). The visual display of quantitative information (Bd. 2). Graphics press Cheshire, CT.